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LONG-TERM  GOALS 

The  long-term  goals  of  this  project  are  to  improve  our  ability  to  monitor  and  predict  the  ocean 
circulation  through  the  assessment  and  design  of  ocean  observing  systems;  and  through  the 
development  of  practical  methods  for  ocean  data  assimilation. 

OBJECTIVES 

We  seek  to  develop  new  approaches,  and  to  apply  existing  methods,  to  the  problem  of  observing 
system  design  and  assessment.  This  involves  the  establishment  of  a  methodology  and  practical 
capability  to  support  the  routine  assessment  of  the  ocean  observing  system;  to  apply  these  capabilities 
to  gain  insight  into  the  potential  benefits  of  future  observation  types  for  data  assimilating  models;  and 
to  support  operational  groups  in  the  design  of  new  observing  systems,  either  for  specific  targeted 
activities  or  for  monitoring  programs. 

APPROACH 

A  range  of  different  approaches  to  observing  system  design  and  assessment  have  been  exploited  under 
this  project.  These  include  traditional  methods,  including  Observing  System  Experiments  (OSEs)  and 
Observing  System  Simulation  Experiments  (OSSEs),  and  new  methods  that  draw  on  aspects  of  data 
assimilation  theory.  We  have  applied  these  tools  to  the  assessment  and  design  of  ocean  observing 
systems  on  a  range  of  scales  and  applications.  This  includes  an  assessment  of  the  relative  importance 
of  different  components  of  the  global  ocean  observing  system  (GOOS),  the  design  of  a  tropical 
mooring  array,  and  an  assessment  of  the  potential  benefits  of  different  shelf  observation  platforms  to  a 
data  assimilating  model.  A  summary  of  OSE  and  OSSE  activities  conducted  under  the  Global  Ocean 
Data  Assimilation  Experiment  (GODAE),  an  international  effort  in  the  development  of  ocean  forecast 
and  analysis  capabilities,  is  described  by  Oke  et  al.  (2009a). 

Several  aspects  of  ensemble  data  assimilation  have  been  explored  under  this  project.  This  includes  an 
effort  to  understand  the  impact  of  using  different  formulations  of  ensemble  square  root  filters,  as  well 
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as  the  development  of  a  new  EnKF  algorithm.  These  activities  have  been  conducted  with  the  aid  of  a 
series  of  small  models. 

Dr.  Peter  Oke  is  the  P.I.  on  this  project  and  leads  the  ocean  data  assimilation  activities  at  the  Centre  for 
Australian  Climate  and  Weather  Research  (CAWCR),  a  partnership  between  CSIRO  and  the  BoM. 
Other  researchers  contributing  to  this  research  include  Dr.  Pavel  Sakov,  now  based  at  NERSC  in 
Norway,  Dr.  Terence  O’Kane  from  CAWCR,  Dr.  Hans  Ngodock  and  Dr.  Gregg  Jacobs  from  NRL. 

WORK  COMPLETED 

Observing  System  Design  and  Assessment 

We  have  exploited  both  OSEs  and  OSSEs  under  this  project.  OSEs  generally  involve  the  systematic 
denial,  or  withholding,  of  different  observation  types  from  a  data  assimilating  model  in  order  to  assess 
the  degradation  in  quality  of  a  forecast  or  analysis  when  that  observation  type  is  not  used.  OSEs  are 
most  suited  to  quantifying  the  benefits  of  existing  observation  types  for  data  assimilation  models. 
Examples  of  OSEs  performed  under  this  project  are  documented  by  Oke  and  Schiller  (2007a). 

OSSEs,  on  the  other  hand,  typically  involve  some  sort  of  twin  experiment  where  synthetic 
observations,  usually  extracted  from  a  model,  are  assimilated  into  an  alternative  model.  OSSEs  are 
therefore  well  suited  to  evaluating  the  potential  benefits  of  hypothetical  observations  that  may  not  exist 
yet.  Thus,  these  methods  can  be  used  to  contribute  to  the  design  of  future  observing  systems, 
quantifying  their  possible  impacts  and  limitations.  Examples  of  OSSEs  performed  under  this  project 
are  documented  by  Oke  et  al.  (2007b).  These  examples  contributed  to  the  design  of  the  tropical  Indian 
Ocean  mooring  array. 

A  new  capability  has  been  developed  under  this  project  that  is  built  on  the  work  of  Bishop  et  al. 

(2001).  The  method  seeks  to  identify  the  set  of  observations  that  minimise  the  analysis  error  variance 
for  a  predefined  variable  or  quantity,  given  an  ensemble  of  perturbations  that  represent  the  system's 
background  error  covariance.  The  approach  is  very  general  and  can  be  applied  to  either  a  stationary  or 
time-varying  ensemble.  Applications  using  a  stationary  ensemble  are  most  applicable  to  models  using 
a  3d  assimilation  system,  like  3d-Var  or  EnOI.  Applications  to  a  time-varying  system  are  most 
applicable  to  models  using  a  4d  assimilation  system,  like  4d-Var  or  the  ensemble  Kalman  Filter 
(EnKF).  This  method  is  documented  by  Sakov  and  Oke  (2008a)  and  has  been  used  to  contribute  to  the 
design  and  assessment  of  the  New  South  Wales  (NSW)  node  of  the  Australian  Integrated  Marine 
Observing  system  (IMOS).  The  application  to  NSW  IMOS  is  documented  by  Oke  et  al.  2009b). 

Data  Assimilation 

Under  this  project,  a  fundamental  exploration  of  ensemble  data  assimilation  has  also  been  undertaken. 
The  significance  of  this  work  is  best  understood  in  light  of  the  history  of  ensemble  data  assimilation 
that  is  summarized  below.  The  ensemble  Kalman  filter  (EnKF)  was  first  proposed  by  Evensen  (1994). 
The  idea  of  the  EnKF  was  that  an  ensemble  of  model  perturbations  is  integrated  -  implicitly 
representing  the  system’s  background  error  covariance.  This  covariance  is  used  to  assimilation 
observations.  At  each  analysis  step  the  ensemble  is  updated,  implicitly  representing  the  system’s 
analysis  error  covariance.  Conceptually,  the  framework  for  the  EnKF  has  not  changed  since  its 
conception.  However,  the  formulation  proposed  by  Evensen  was  flawed,  and  the  ensemble  typically 
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collapsed  owing  to  the  under-representation  of  the  ensemble's  analysis  error  covariance.  Subsequently, 
Burgers  et  al.  (1998)  proposed  a  different  fonnulation  of  the  EnKF  where  observations  are  treated  as 
random  variables,  and  are  perturbed.  In  theory,  Burgers'  approach,  known  as  the  perturbed  obs  EnKF, 
should  yield  an  ensemble  with  the  correct  analysis  error  variance.  But  this  approach  is  statistical  and 
therefore  suffers  from  sampling  error  -  particularly  for  small  ensemble  size.  This  renders  the  perturbed 
obs  EnKF  sub-optimal  in  practice.  Ensemble  square  root  filters  (ESRFs;  e.g.,  Tippett  et  al.  2003)  then 
became  popular  in  ocean  and  atmospheric  data  assimilation  because  they  are  deterministic.  They  don’t 
suffer  from  sampling  error  and  are  optimal  for  models  where  the  ensemble  size  exceeds  the  dimension 
of  the  system's  sub-space.  Most  of  the  development  of  ensemble-based  methods  has  been  conducted  on 
small  models  (e.g.,  Lorenz  and  Emanuel  1998).  However,  when  applying  these  methods  to  realistic 
models,  for  example  general  circulation  models,  in  practice  all  ensemble-based  filters  are  rank- 
deficient.  That  is,  the  ensemble  size  is  typically  much  less  than  the  dimension  of  the  model's  sub-space. 
So  there  aren’t  enough  degrees  of  freedom  in  the  ensemble  to  adequately  span  the  model's  error  sub¬ 
space.  This  problem  is  overcome  by  using  covariance  localisation  (e.g.,  Houtekamer  and  Mitchell 
1998).  Localisation  is  traditionally  applied  in  grid  point  space.  This  is  readily  applied  to  the  perturbed 
obs  EnKF,  but  cannot  formally  be  applied  to  ESRFs.  Instead  localised  ESRFs  are  often  implemented 
by  assimilating  only  a  sub-set  of  observations,  local  to  each  grid  point,  and  neglecting  observations 
outside  a  certain  search  radius  (e.g.,  Anderson  2003;  Ott  et  al.  2004).  This  approach  is  practical,  but  is 
not  faithful  to  the  theory  that  underpins  ESRFs.  Under  this  project,  we  have  developed  a  new  filter  that 
is  both  detenninistic,  like  the  ESRFs,  and  readily  pennits  localisation  in  grid-point  space,  like  the 
perturbed  obs  EnKF.  This  filter  is  called  the  Deterministic  EnKF  (DEnKF).  The  DEnKF  combines  the 
numerical  effectiveness,  simplicity  and  versatility  of  the  perturbed  obs  EnKF  with  the  optimal 
performance  of  the  ESRFs.  The  DEnKF  is  documented  by  Sakov  and  Oke  (2008b). 

We  have  investigated  the  impacts  of  using  different  ensemble  transfonnations  in  ESRFs.  It  is  found 
that  only  a  sub-set  of  ensemble  filters  should  be  used  in  practice,  and  that  some  formulations 
significantly  degrade  the  filter  performance.  This  work  is  documented  by  Sakov  and  Oke  (2008c). 

RESULTS 

Observing  system  Assessment 

A  series  of  OSEs  have  been  performed  using  the  Bluelink  ReANalysis  (BRAN;  Oke  et  al.  2008) 
system.  The  Bluelink  ocean  data  assimilation  system  (BODAS;  Oke  et  al.  2005;  2008)  that  underpins 
BRAN  is  based  on  Ensemble  Optimal  Interpolation  (EnOI).  EnOI  is  well  suited  to  OSEs  -  it  is 
multivariate,  using  observations  of  one  type  to  update  variables  of  all  types,  and  readily  assimilates 
observations  of  different  types  in  a  single  step.  An  example  of  the  multivariate  nature  of  EnOI  is 
presented  in  Figure  1  showing  the  sea-surface  temperature  (SST),  sea-level-anomaly  (SLA)  and  sub¬ 
surface  temperature  (T)  increments  when  different  observations  are  assimilated.  For  this  example,  a 
series  of  warm-core  and  cold-core  eddies  are  evident  in  the  SLA  observations,  but  only  a  cold-core 
eddy  is  evident  in  the  SST  observations.  This  can  occur  when,  for  example,  the  wann-core  eddies  are 
capped  by  cold  near-surface  waters.  As  a  result,  we  find  that  when  only  altimetry  is  assimilated  (Figure 
la,d,g)  the  increments  reflect  both  the  warm-core  and  cold-core  eddies,  with  a  clear  surface-expression 
of  the  eddies  in  the  SST  increments.  By  contrast,  when  only  SST  is  assimilated  (Figure  lb,e,h),  the 
increments  reflect  only  cold-core  eddies,  plus  a  general  T  decrease  over  most  of  the  region  shown. 
When  both  altimetry  and  SST  are  assimilated  (Figure  lc,  f,  i),  the  increments  reflect  both  the  wann- 


3 


core  and  cold-core  eddies,  as  well  as  the  surface  T  decrease.  This  example  demonstrates  the 
importance  of  different  types  of  observation  for  assimilation. 
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Figure  1:  Examples  of  (a,  b,  c)  SST  increments,  (d,  e,f)  SLA  increments,  and  (g,  h,  i)  temperature 
increments  for  a  longitude-depth  section  at  24.4° S  (sections  denoted  in  a-c)  when  only  altimetry  (a, 
d,  g)  only  SST  (b,  e,  h)  and  both  altimetry  and  SST  (c,f,  i)  observations  are  assimilated. 

[Figure  showing  impact  of  different  observation  types] 

The  OSEs  presented  by  Oke  and  Schiller  (2007)  include  6  month  data  assimilating  runs  with  different 
observation  types  assimilated,  including:  ALL  (Argo+SST+ALTIM),  NONE  (no  assimilation), 
Argo+SST,  ALTIM+SST,  and  ALTIM+Argo.  The  RMS  difference  between  the  observed  and 
reanalysed  SLA  and  SST  for  each  OSE  is  presented  in  Ligure  2  and  Ligure  3  respectively.  Reanalysed 
SLA  fields  are  compared  to  along-track  SLA  (atSLA)  from  all  available  altimeters  (Jason,  Envisat  and 
GEO).  Reanalysed  SST  fields  are  compared  to  SST  from  AMSR-E.  Lor  comparison,  the  observed 
standard  deviations  are  also  shown  in  Ligures  2  and  3  (for  SST,  this  is  based  on  anomalies  from  the 
seasonal  cycle).  The  area-averaged  RMS  difference  between  observed  and  reanalysed  sub-surface  T 
and  salinity  (S)  in  the  Australian  region  for  each  OSE  is  presented  in  Ligure  4. 
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The  OSE  that  assimilate  all  observations  perform  the  best  overall,  with  relatively  small  residuals  for  all 
variables.  By  contrast  each  OSE  that  assimilates  only  two  of  the  three  observation  types  performs 
poorly  for  at  least  one  variable.  For  example,  when  only  Argo  and  SST  are  assimilated,  the  residuals  of 
SLA  are  significantly  larger  than  ALL  (Figure  2).  Similarly,  when  only  altimetry  and  SST  are 
assimilated,  the  residuals  in  sub-surface  T  and  S  are  large  compared  to  ALL  (Figure  4);  and  when  only 
altimetry  and  Argo  are  assimilated,  the  residuals  for  SST  become  comparable,  in  some  locations,  to 
those  in  NONE  (Figure  3). 

Based  on  the  OSEs  presented  by  Oke  and  Schiller  (2007),  we  conclude  that  each  observation  type 
brings  complementary  information  to  the  GOOS.  This  result  sends  a  clear  message  to  policy  makers 
that  all  observation  types  are  needed  to  constrain  mesoscale  ocean  models  either  for  reanalysis  or 
prediction. 


RMS  residuals  (cm) 


Figure  2:  RMS  residuals  between  observed  and  modelled  SLA  for  each  OSE;  and  the  observed 
standard  deviation  (bottom  right).  Statistics  are  computed  using  atSLA  observations  from  Jason, 
Envisat,  and  GFO  for  the  period  January-May  2006. 

[Figure  showing  relative  SLA  errors  in  an  eddy-resolving  model  run  when  different  observation 

types  are  assimilated] 
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Figure  3:  RMS  residuals  between  observed  and  modelled  SST  for  each  OSE;  and  the  observed 
standard  deviation  (bottom  right).  Statistics  are  computed  using  AMSRE  observations  for  the  period 

January-May  2006. 

[Figure  showing  relative  SST  errors  in  an  eddy-resolving  model  run  when  different  observation 

types  are  assimilated] 
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Figure  4:  RMS  residuals  between  observed  and  modelled  temperature  (left)  and  salinity  (right),  for 
each  OSE.  Statistics  are  computed  for  the  region  90-1 80° E  and  south  of  the  equator  using  3159 

Argo  profiles  for  the  period  January-May  2006. 

[Figure  showing  relative  temperature  and  salinity  errors  in  an  eddy-resolving  model  run  when 

different  observation  types  are  assimilated] 


Observing  System  Design 

The  versatility  of  the  ensemble-based  method  for  observing  system  design  developed  here  (Sakov  and 
Oke  2008a)  is  demonstrated  in  the  published  literature  through  applications  to  the  design  of  the 
tropical  Indian  Ocean  (TIO;  Sakov  and  Oke  2008a)  and  the  NSW  IMOS  shelf  observation  program 
(Oke  et  al.  2009b). 

The  application  to  the  TIO  (Sakov  and  Oke  2008a)  shows  that  the  proposed  TIO  mooring  array  should 
produce  results  that  compare  well  to  the  objectively  designed  arrays  for  intraseasonal  variability.  This 
study  also  identified  regions  in  the  east  and  northeast  of  the  TIO  and  south  of  India  as  the  most 
important  regions  for  monitoring  intraseasonal  variability  in  the  TIO.  Observations  near  9°S,  where 
seasonal  Rossby  waves  are  prevalent,  are  also  identified  as  key  regions  for  monitoring  seasonal-to- 
interannual  variability. 

The  results  of  the  application  to  NSW  IMOS  (Oke  et  al.  2009b)  demonstrate  that  if  high  frequency 
(HF)  radar  observations  are  assimilated  along  with  the  standard  components  of  the  GOOS,  the  analysis 
errors  are  likely  to  reduce  by  as  much  as  80%  for  velocity  and  60%  for  T,  S  and  sea-level  in  the 
vicinity  of  the  observations.  This  is  an  important  result  for  Bluelink,  since  it  impacts  the  planning  of 
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future  Bluelink  activities  and  helps  set  priorities  for  the  uptake  of  new  data  types  into  the  operational 
forecast  system. 

Data  Assimilation 

Sakov  and  Oke  (2008b)  present  a  number  of  experiments  to  compare  the  performance  of  the  DEnKF 
with  both  the  perturbed  obs  EnKF  and  an  ESRF  using  three  small  models.  It  is  shown  that  the  DEnKF 
performs  as  well  as  the  ESRF  and  is  a  significant  improvement  over  the  perturbed  obs  EnKF.  It  is  also 
shown  that  under  many  circumstances,  the  DEnKF  is  more  robust  than  ESRFs  to  mis-specification  of 
tuning  parameters,  like  inflation. 

The  paper  by  Sakov  and  Oke  (2008c)  considers  implications  of  different  forms  of  the  ensemble 
transformation  in  the  ensemble  square  root  filters  (ESRFs)  for  the  performance  of  ESRF -based  data 
assimilation  systems.  They  show  that  it  is  important  to  use  a  mean-preserving  solution  for  the 
ensemble  transfonn  matrix  in  ESRFs.  Not  doing  so  seriously  degrades  the  performance  of  the  ESRF. 

IMPACT/APPLICATIONS 

The  tools  for  objective  array  design  that  have  been  developed  under  this  project  are  very  powerful. 
Given  a  time  series,  or  model  ensemble,  of  oceanic  fields  for  some  region,  these  tools  facilitate  the 
efficient  design  and  assessment  of  an  observation  array  for  that  region.  These  calculations  can  be 
readily  perfonned  in  real  time,  together  with  an  EnKF,  to  undertake  adaptive  sampling.  Our 
experiments  indicate  that  the  most  critical  aspect  of  any  application  is  formulating  the  cost  function. 
That  is,  in  determining  exactly  what  it  is  that  we  wish  to  monitor.  For  example,  an  “optimal”  array  for 
monitoring  intraseasonal  mixed  layer  depth  is  likely  to  be  very  different  from  an  “optimal”  array  for 
monitoring  interannual  variability  (Sakov  and  Oke  2008a).  The  results  from  the  application  of  the 
ensemble-based  method  for  observing  system  design  to  NSW  IMOS  have  been  used  by  the  Bluelink 
science  team  to  set  priorities  on  the  uptake  on  new  observation  types  into  the  operational  Bluelink 
forecast  system. 

The  results  from  the  OSEs  presented  by  Oke  and  Schiller  (2007)  provide  an  important  assessment  of 
the  perfonnance  of  the  GOOS  for  a  data-assimilating,  eddy-resolving  ocean  forecast  system.  Such 
information  is  essential  for  planning  investment  in  the  maintenance  and  development  of  the  GOOS. 
The  finding  that  different  observation  types  provide  complimentary  information  is  an  important  result 
that  should  help  persuade  policy  makers  to  maintain  all  of  the  observation  programs. 

Sakov  and  Oke  (2008c)  demonstrate  that  only  mean-preserving  ensemble  transformations  should  be 
used  by  ESRFs.  This  is  a  very  clear  result,  based  on  both  theoretical  and  experimental  evidence.  This 
development  has  been  adopted  by  the  EnKF  community  (see 

http://enkf.nersc.no/Code/Analysis/meanpres.pdf).  The  development  of  the  DEnKF  provides  a 
computationally  efficient  alternative  to  the  ensemble  data  assimilation  community.  The  uptake  of  the 
DEnKF  has  steadily  increased  since  the  publication  of  Sakov  and  Oke  (2008b). 

RELATED  PROJECTS 

Bluelink  is  a  partnership  between  CSIRO,  the  Bureau  of  Meteorology  and  the  Royal  Australian  Navy. 
Many  of  the  research  activities  undertaken  in  Bluelink  have  strong  synergies  for  the  project  that  is  the 
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subject  of  this  annual  report.  The  main  objective  of  Bluelink  is  the  development  and  application  of  an 
ocean  forecast  system  for  the  mesoscale  circulation  around  Australia.  Applications  of  the  Bluelink 
system  are  well  documented  (e.g.,  Oke  et  al.  2005;  2008;  2009c;  Schiller  et  al.  2008).  An  example  of 
results  from  the  latest  Bluelink  reanalysis  experiment  is  presented  in  Figure  5. 

The  Australian  Integrated  Marine  Observing  System  (IMOS)  program  (www.imos.org.au).  IMOS 
involves  the  provision  of  observational  platforms  (e.g.,  gliders,  high-frequency  radars,  moorings)  to 
establish  a  long-term  monitoring  capability  for  the  oceans  around  Australia. 
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Figure  5:  Monthly  mean  sea-level  from  the  latest  Bluelink  Reanalysis,  with  surface  drifter  velocities 

and  trajectories  overlaid. 

[Figure  showing  good  agreement  between  reanalyzed  SLA  and  independent  surface  drifter 

trajectories  in  the  latest  Bluelink  Reanalysis] 
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